An algebraic semigroup method for discovering maximal frequent itemsets
نویسندگان
چکیده
Abstract Discovering maximal frequent itemsets is an important issue and key technique in many data mining problems such as association rule mining. In the literature, generating proves either to be NP-hard or have O ( l 3 4 m + n ) O\left({l}^{3}{4}^{l}\left(m+n)) complexity worst case from perspective of complete bipartite graphs a graph, where m , n are item number transaction number, respectively, l denotes maximum ∣ C Ψ / − 1 | C| \Psi \left(C)| \hspace{0.1em}\text{/}\hspace{0.1em}\left(| +| -1) with taken over all C . this article, we put forward method for discovering itemsets, whose 2 β O\left(3mn{2}^{\beta }+{4}^{\beta }n) lower than known both case, semigroup algebra, \beta items support more minimum threshold. Experiments also show that algorithm based on algebraic performs better other three well-known algorithms. Meanwhile, explore some properties respect transactions, prove exactly simplified generators give necessary sufficient condition i i+1 -frequent itemset being subset closed i itemset, provide recurrence formula itemsets.
منابع مشابه
Discovering Frequent Closed Itemsets for Association Rules
HAL is a multi-disciplinary open access archive for the deposit and dissemination of scientific research documents, whether they are published or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers. L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau...
متن کاملEfficiently Mining Maximal Frequent Itemsets
We present GenMax, a backtrack search based algorithm for mining maximal frequent itemsets. GenMax uses a number of optimizations to prune the search space. It uses a novel technique called progressive focusing to perform maximality checking, and diffset propagation to perform fast frequency computation. Systematic experimental comparison with previous work indicates that different methods have...
متن کاملA comprehensive method for discovering the maximal frequent set
The association rule mining can be divided into two steps.The first step is to find out all frequent itemsets, whose occurrences are greater than or equal to the user-specified threshold.The second step is to generate reliable association rules based on all frequent itemsets found in the first step. Identifying all frequent itemsets in a large database dominates the overall performance in the a...
متن کاملMaximal Frequent Itemsets Mining Using Database Encoding
Frequent itemsets mining is a classic problem in data mining and plays an important role in data mining research for over a decade. However, the mining of the all frequent itemsets will lead to a massive number of itemsets. Fortunately, this problem can be reduced to the mining of maximal frequent itemsets. In this paper, we propose a new method for mining maximal frequent itemsets. Our method ...
متن کاملHigh Performance Mining of Maximal Frequent Itemsets
Mining frequent itemsets is instrumental for mining association rules, correlations, multi-dimensional patterns, etc. Most existing work focuses on mining all frequent itemsets. However, since any subset of a frequent set also is frequent, it is sufficient to mine only the set of maximal frequent itemsets. In this paper, we study the performance of two existing approaches, Genmax and Mafia, for...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Open Mathematics
سال: 2022
ISSN: ['2391-5455']
DOI: https://doi.org/10.1515/math-2022-0516